I/" 



Page 1 of 7 




1600 



RAW SEQUENCE LISTING 

PATENT APPLICATION; US/09/684 , 2 15A 



DATE: 04/16/2002 
TIME: 09:31:03 



Input Set : A:\-80-l.app 

Output Set: N:\CRF3\04162002\I684215A.xaw 



<110> APPLICANT: Skeiky, 
Guderian, Jeffrey 
Corixa Corporation 

<120> TITLE OF INVENTION; 



Yasir 



Does Not Comply 
Corrected Diskette Needed 



Methods of Using a Mycobacterium tuberculosis Coding 
Sequence to Facilitate Stable and High Yield Expression 
of Heterologous Proteins 
11 <130> FILE REFERENCE: 014058-008010US 

13 <140> CURRENT APPLICATION NUMBER: US 09/684 , 215A 

14 <141> CURRENT FILING DATE: 2000-10-06 

16 <150> PRIOR APPLICATION NUMBER: US 60/158,585 

17 <151> PRIOR FILING DATE: 1999-10-07 
19 <160> NUMBER OF SEQ ID NOS : 22 

21 <170> SOFTWARE: Patentln Ver. 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 1872 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Mycobacterium tuberculosis 
2 8 <220> FEATURE: 

29 <223> OTHER INFORMATION: 32 KD serine protease MTB32A 

31 <220> FEATURE: 

32 <221> NAME/KEY: CDS 

33 <222> LOCATION: (89).. (1156) 

34 <223> OTHER INFORMATION: MTB32A 

36 <220> FEATURE: 

37 <221> NAME/KEY: sig_peptide 

38 <222> LOCATION: (89).. (184) 

39 <223> OTHER INFORMATION: N-terminal hydrophobic secretory signal sequence 
41 <220> FEATURE: 

mat_peptide 
(185) . . (1153) 
1 



42 <221> NAME/KEY 

43 <222> LOCATION 



45 <400> SEQUENCE 

46 gactacgttg gtgtagaaaa atcctgccgc ccggaccctt aaggctggga caatttctga 60 
4 8 tagctacccc gacacaggag gttacggg atg age aat teg cgc cgc cgc tea 

Met Ser Asn Ser Arg Arg Arg Ser 



112 



49 

50 



■30 



-25 



52 etc agg tgg tea tgg ttg ctg age gtg ctg get gee gtc ggg ctg ggc 

53 Leu Arg Trp Ser Trp Leu Leu Ser Val Leu Ala Ala Val Gly Leu Gly 

54 -20 -15 -10 

56 ctg gec acg gcg ccg gec cag gcg gec ccg ccg gec ttg teg cag gac 

57 Leu Ala Thr Ala Pro Ala Gin Ala Ala Pro Pro Ala Leu Ser Gin Asp 

58 -5 -11 5 

60 egg ttc gec gac ttc ccc gcg ctg ccc etc gac ccg tec gcg atg gtc 

61 Arg Phe Ala Asp Phe Pro Ala Leu Pro Leu Asp Pro Ser Ala Met Val 



160 



208 



256 
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62 10 15 20 

64 gcc caa gtg ggg cca cag gtg gtc aac ate aac acc aaa ctg ggc tac 304 

65 Ala Gin Val Gly Pro Gin Val Val Asn He Asn Thr Lys Leu Gly Tyr 

66 25 30 35 40 

68 aac aac gcc gtg ggc gcc ggg acc ggc ate gtc ate gat ccc aac ggt 352 

69 Asn Asn Ala Val Gly Ala Gly Thr Gly He Val He Asp Pro Asn Gly 

70 45 50 55 

72 gtc gtg ctg acc aac aac cac gtg ate gcg ggc gcc acc gac ate aat 400 

73 Val Val Leu Thr Asn Asn His Val He Ala Gly Ala Thr Asp lie Asn 

74 60 65 70 

76 gcg ttc age gtc ggc tec ggc caa acc tac ggc gtc gat gtg gtc ggg 448 

77 Ala Phe Ser Val Gly Ser Gly Gin Thr Tyr Gly Val Asp Val Val Gly 

78 75 80 85 

80 tat gac cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gcc ggt 496 

81 Tyr Asp Arg Thr Gin Asp Val Ala Val Leu Gin Leu Arg Gly Ala Gly 

82 90 95 100 

84 ggc ctg ccg teg gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc 544 

85 Gly Leu Pro Ser Ala Ala He Gly Gly Gly Val Ala Val Gly Glu Pro 

86 105 110 115 120 

88 gtc gtc gcg atg ggc aac age ggt ggg cag ggc gga acg ccc cgt gcg 592 

89 Val Val Ala Met Gly Asn Ser Gly Gly Gin Gly Gly Thr Pro Arg Ala 

90 125 130 135 

92 gtg cct ggc agg gtg gtc gcg etc ggc caa acc gtg cag gcg teg gat 640 

93 Val Pro Gly Arg Val Val Ala Leu Gly Gin Thr Val Gin Ala Ser Asp 

94 140 145 150 

96 teg ctg acc ggt gcc gaa gag aca ttg aac ggg ttg ate cag ttc gat 688 

97 Ser Leu Thr Gly Ala Glu Glu Thr Leu Asn Gly Leu He Gin Phe Asp 

98 155 160 165 

100 gcc gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc eta 7 36 

101 Ala Ala He Gin Pro Gly Asp Ser Gly Gly Pro Val Val Asn Gly Leu 

102 170 175 180 

104 gga cag gtg gtc ggt atg aac acg gcc gcg tec gat aac ttc cag ctg 784 

105 Gly Gin Val Val Gly Met Asn Thr Ala Ala Ser Asp Asn Phe Gin Leu 

106 185 190 195 200 

108 tec cag ggt ggg cag gga ttc gcc att ccg ate ggg cag gcg atg gcg 832 

109 Ser Gin Gly Gly Gin Gly Phe Ala He Pro He Gly Gin Ala Met Ala 

110 205 210 215 

112 ate gcg ggc cag ate cga teg ggt ggg ggg tea ccc acc gtt cat ate 880 

113 He Ala Gly Gin He Arg Ser Gly Gly Gly Ser Pro Thr Val His He 

114 220 225 230 

116 ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc gac aac aac ggc aac 928 

117 Gly Pro Thr Ala Phe Leu Gly Leu Gly Val Val Asp Asn Asn Gly Asn 

118 235 240 245 

120 ggc gca cga gtc caa cgc gtg gtc ggg age get ccg gcg gca agt etc 976 

121 Gly Ala Arg Val Gin Arg Val Val Gly Ser Ala Pro Ala Ala Ser Leu 

122 250 255 260 

124 ggc ate tec acc ggc gac gtg ate acc gcg gtc gac ggc get ccg ate 1024 

125 Gly He Ser Thr Gly Asp Val He Thr Ala Val Asp Gly Ala Pro He 

126 265 270 275 280 
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128 


aac 


teg 


gec 


acc 


gcg 


atg gcg gac gcg 


ctt 


aac 


ggg 


cat 


cat 


ccc 


ggt 


1072 


129 


Asn 


Ser 


Ala 


Thr 


Ala 


Met Ala Asp Ala 


Leu 


Asn 


Gly His 


His 


Pro Gly 




130 










285 




290 










295 






132 


gac 


gtc 


ate 


teg 


gtg 


acc tgg caa acc 


aag 


teg 


ggc 


ggc 


acg 


cgt 


aca 


1120 


133 


Asp 


Val 


He 


Ser 


Val 


Thr Trp Gin Thr 


Lys 


Ser 


Gly Gly 


Thr 


Arg 


Thr 




134 








300 




305 










310 








136 


ggg 


aac 


gtg 


aca 


ttg 


gec gag gga ccc 


ccg 


gec 


tga 


tttcgtcgcg 




1166 


137 


Gly Asn Val 


Thr 


Leu 


Ala Glu Gly Pro 


Pro 


Ala 














138 






315 






320 


















140 


gataccaccc 


gccggccggc caattggatt ggcgccagcc 


gtgattgccg 


cgtgagcccc 


1226 


142 


cgagttccgt 


ctcccgtgcg cgtggcatcg tggaagcaat 


gaacgaggca 


gaacacagcg 


1286 


144 


tcgagcaccc 


tcccgtgcag ggcagtcacg tegaaggegg 


tgtggtcgag 


catceggatg 


1346 


146 


ccaaggactt 


cggcagcgcc gccgccctgc ccgecgatcc 


gacctggttt 


aagcacgccg 


1406 


148 


tcttctacga 


ggtgctggtc egggegttet tcgacgccag 


cgcggacggt 


tccggcgatc 


1466 


150 


tgcgtggact 


catcgatcgc ctcgactacc tgcagtggct 


tggcatcgac 


tgcatctggt 


1526 


152 


tgccgccgtt 


ctacgactcg ccgctgcgcg aeggeggtta 


cgacattcgc 


gacttctaca 


1586 


154 


aggtgctgcc 


egaattegge accgtcgacg atttegtege 


cctggtcgac 


gccgctcacc 


1646 


156 


ggcgaggtat 


ccgcatcatc accgacctgg tgatgaatca 


cacctcggag 


tcgcacccct 


1706 


158 


ggtttcagga 


gtcccgccgc gacccagacg gaeegtaegg 


tgactattac 


gtgtggagcg 


1766 


160 


acaccagcga 


gcgctacacc gacgcccgga teatcttegt 


cgacaccgaa 


gagtcgaact 


1826 


162 


ggtcattcga 


tcctgtccgc cgacagttct actggcaccg 


attctt 








1872 


165 


<210> SEQ ID NO 


: 2 




















166 


<211> LENGTH: 355 




















167 
168 


<212> TYPE: PRT 
<213> ORGANISM: 


Mycobacterium tuberculosis 




VZ2o7 


169 


<223> OTHER 


INFORMATION: 32 KD serine protease 1 


4TB32A 








171 


<400> SEQUENCE: 


2 




















172 


Met 


Ser 


Asn 


Ser 


Arg 


Arg Arg Ser Leu 


Arg 


Trp 


Ser 


Trp 


Leu 


Leu 


Ser 




173 


1 








5 




10 










15 






174 


Val 


Leu 


Ala 


Ala 


Val 


Gly Leu Gly Leu 


Ala 


Thr 


Ala 


Pro 


Ala 


Gin 


Ala 




175 








20 




25 










30 








176 


Ala 


Pro 


Pro 


Ala 


Leu 


Ser Gin Asp Arg 


Phe 


Ala 


Asp 


Phe 


Pro 


Ala 


Leu 




177 






35 






40 








45 










178 


Pro 


Leu 


Asp 


Pro 


Ser 


Ala Met Val Ala 


Gin 


Val 


Gly 


Pro 


Gin 


Val 


Val 




179 




50 








55 






60 












180 


Asn 


He 


Asn 


Thr 


Lys 


Leu Gly Tyr Asn 


Asn 


Ala 


Val 


Gly 


Ala 


Gly 


Thr 




181 


65 










70 




75 










80 




182 


Gly 


He 


Val 


He 


Asp 


Pro Asn Gly Val 


Val 


Leu 


Thr 


Asn 


Asn 


His 


Val 




183 










85 




90 










95 






184 


He 


Ala 


Gly 


Ala 


Thr 


Asp He Asn Ala 


Phe 


Ser 


Val 


Gly 


Ser 


Gly 


Gin 




185 








100 




105 










110 








186 


Thr 


Tyr 


Gly 


Val 


Asp 


Val Val Gly Tyr 


Asp 


Arg 


Thr 


Gin 


Asp 


Val 


Ala 




187 






115 






120 








125 










188 


Val 


Leu 


Gin 


Leu 


Arg 


Gly Ala Gly Gly 


Leu 


Pro 


Ser 


Ala 


Ala 


He 


Gly 




189 




130 








135 






140 












190 


Gly 


Gly 


Val 


Ala 


Val 


Gly Glu Pro Val 


Val 


Ala 


Met 


Gly 


Asn 


Ser 


Gly 




191 


145 










150 




155 










160 




192 


Gly 


Gin 


Gly 


Gly 


Thr 


Pro Arg Ala Val 


Pro 


Gly 


Arg 


Val 


Val 


Ala 


Leu 




193 










165 




170 










175 
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194 


Gly 


Gin 


Thr 


Val 


Gin 


Ala 


Ser 


Asp 


Ser 


Leu 


Thr Gly Ala 


Glu 


Glu 


Thr 


195 








180 










185 










190 






196 


Leu 


Asn 


Gly 


Leu 


He 


Gin 


Phe 


Asp 


Ala 


Ala 


He 


Gin 


Pro 


Gly 


Asp 


Ser 


197 






195 










200 










205 








198 


Gly 


Gly 


Pro 


Val 


Val Asn Gly 


Leu Gly 


Gin 


Val 


Val 


Gly 


Met 


Asn 


Thr 


199 




210 










215 










220 










200 


Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 


Ser 


Gin 


Gly Gly Gin 


Gly 


Phe 


Ala 


201 


225 










230 










235 










240 


202 


He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 


He 


Ala 


Gly Gin 


He 


Arg 


Ser 


Gly 


203 










245 










250 










255 




204 


Gly 


Gly 


Ser 


Pro 


Thr 


Val 


His 


He Gly 


Pro 


Thr 


Ala 


Phe 


Leu 


Gly 


Leu 


205 








260 










265 










270 






206 


Gly 


Val 


Val 


Asp 


Asn Asn Gly 


Asn 


Gly 


Ala 


Arg 


Val 


Gin 


Arg 


Val 


Val 


207 






275 










280 










285 








208 


Gly 


Ser 


Ala 


Pro 


Ala 


Ala 


Ser 


Leu 


Gly 


He 


Ser Thr Gly 


Asp 


Val 


He 


209 




290 










295 










300 










210 


Thr 


Ala 


Val 


Asp 


Gly Ala 


Pro 


He 


Asn 


Ser 


Ala 


Thr 


Ala 


Met 


Ala 


Asp 


211 


305 










310 










315 










320 


212 


Ala 


Leu 


Asn 


Gly 


His 


His 


Pro 


Gly Asp 


Val 


He 


Ser 


Val 


Thr 


Trp 


Gin 


213 










325 










330 










335 




214 


Thr 


Lys 


Ser 


Gly 


Gly Thr Arg 


Thr 


Gly 


Asn 


Val 


Thr 


Leu 


Ala 


Glu 


Gly 


215 








340 










345 










350 






216 


Pro 


Pro 


Ala 




























217 






355 




























220 


<210> SEQ ID NO 


3 
























221 


<211> LENGTH: 396 
























222 


<212> TYPE: 


DNA 


























223 


<213> ORGANISM: 


Mycobacterium tuberculosis 












225 


<220> FEATURE: 


























226 


<22 3> OTHER 


INFORMATION 


14 


KD C-terminal fragment of MTB32A Ra: 


228 


<220> FEATURE: 


























229 


<221> NAME/KEY: 


CDS 
























230 


<222> LOCATION: 


(1) 


.(396) 




















231 


<223> OTHER 


INFORMATION 


Ral2 


















233 


<400> SEQUENCE: 


3 
























234 


acg 


gcc 


gcg 


tec 


gat 


aac 


ttc 


cag 


ctg 


tec 


cag 


ggt 


ggg 


cag 


gga 


ttc 


235 


Thr 


Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 


Ser 


Gin 


Gly 


Gly 


Gin 


Gly 


Phe 


236 


1 








5 










10 










15 




238 


gcc 


att 


ccg 


ate 


ggg 


cag 


gcg 


atg 


gcg 


ate 


gcg 


ggc 


cag 


ate 


cga 


teg 


239 


Ala 


He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 


He 


Ala 


Gly 


Gin 


He 


Arg 


Ser 


240 








20 










25 










30 






242 


ggt 


ggg 


ggg 


tea 


ccc 


acc 


gtt 


cat 


ate 


ggg 


cct 


acc 


gcc 


ttc 


etc 


ggc 


243 


Gly 


Gly 


Gly 


Ser 


Pro 


Thr 


Val 


His 


He 


Gly 


Pro 


Thr 


Ala 


Phe 


Leu 


Gly 


244 






35 










40 










45 








246 


ttg 


ggt 


gtt 


gtc 


gac 


aac 


aac 


ggc 


aac 


ggc 


gca 


cga 


gtc 


caa 


cgc 


gtg 


247 


Leu 


Gly 


Val 


Val 


Asp 


Asn 


Asn 


Gly 


Asn 


Gly 


Ala 


Arg 


Val 


Gin 


Arg 


Val 


248 




50 










55 










60 










250 


gtc 


ggg 


age 


get 


ccg 


gcg 


gca 


agt 


etc 


ggc 


ate 


tec 


acc 


ggc 


gac 


gtg 


251 


Val 


Gly 


Ser 


Ala 


Pro 


Ala 


Ala 


Ser 


Leu 


Gly 


He 


Ser 


Thr 


Gly 


Asp 


Val 



48 



96 



144 



192 



240 
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